Exact results and scaling properties of small-world networks 
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We study the distribution function for minimal paths in small- world networks. Using properties of 
this distribution function, we derive analytic results which greatly simplify the numerical calculation 
of the average minimal distance, 1, and its variance, a^. We also discuss the scaling properties of 
the distribution function. Finally, we study the limit of large system sizes and obtain some analytic 
results. 

PACS numbers: 05.10.-a,05.40.-a,05.50.+q,87.18.Sn 

Recently Watts and Strogatz Q have studied a class 
of networks which can be 'tuned' from an ordered to a 
random state by varying a single parameter. For a range 
of parameter values, they found that the networks resem- 
bled ordered networks locally but random networks glob- 
ally. They named this class of networks "small-world" 
networks because of their relevance to a well-known prob- 
lem in sociology ||^ . Since their introduction, small- world 
networks and their properties have received considerable 
attention, |4|-[l8|, in part because of their possible appli- 
cation to a broad range of systems, ranging from social 
networks |^ to coupled oscillators 

Much of the work characterizing the properties of 
small-world networks has focused on the average mini- 
mal distance, i, separating two randomly chosen points 
in the network. Previous work has shown that i has two 
scaling regimes: for small systems sizes L, it is found 
that i ^ L, whereas for large L, Z ~ log(L), A 
scaling form for £ has been proposed and numerically 
confirmed; however the nature of the crossover between 
the two scaling regimes has been the subject of debate 
P,p| pO|Jl^ , p^ , p^ . In this work, we will focus on some 
basic probability distributions for small-world networks, 
and as a consequence obtain an exact result for i which 
greatly aids its numerical computation. 

We generate the networks following the prescription of 
Newman and Watts ||] . We start with a 1-d periodic lat- 
tice with L = 2N sites and nearest neighbor connections. 
We then add short-cuts uniformly with probability p such 
that the average number of short-cuts added is x = pL. 
We denote the distance between two sites, counted along 
the lattice using only nearest neighbor links, as the Eu- 
clidean distance. By contrast, the shortest distance be- 
tween two sites, counted along any bond including short- 
cut bonds, is called the minimal distance. 

Using these definitions, we now introduce the following 
probability functions: (i) P(n\m), the probability that 
two sites are separated by Euclidean distance n given that 
their minimal distance is m; (ii) P{m\n), the probability 
that two sites have minimal separation m given that their 
Euclidean distance is n; and (iii) Q{m), the probability 
that two randomly chosen sites have a minimal separa- 



tion m. Recently, Dorogovtsev et al have introduced 
two exactly solvable models similar to small-world net- 
works. For these models they derive the explicit form of 
P{m\n), from which they obtain £ and other properties 
of their networks. In this Letter, we derive the general 
form of P{rn\n) for small-world networks, and we confirm 
it numerically. Using this form, we derive an exact ex- 

2 

pression for I and for the variance oi I, a"^ = P — t . We 
also study the scaling properties of P{m\n) and obtain 
some approximate results for it in the limit of large L. 
Note that in describing the networks, we have considered 
the case of coordination number of 2fc = 2 for each site. 
However, our arguments for the general form of P{m\n) 
are valid for arbitrary k. For simplicity we will consider 
the case fc = 1 in the rest of this Letter and generaliza- 
tions to arbitrary k will be indicated as appropriate. 

We begin by deriving the general form of P{n\m). 
First, since the minimal distance cannot exceed the Eu- 
cfidean distance, P(n|m) = for n < m. For n > m, the 
minimal path must use at least one short-cut. But tak- 
ing a short-cut is equivalent to randomizing the position 
along the network, since the short-cuts are uniformly dis- 
tributed. Hence, P{n\m) must be independent of n for all 
n > m. Finally, for n = m, it is not necessary to use any 
short-cuts in the minimal path; so the arguments invoked 
for n > TO do not apply. Instead, P(n\n) is determined 
by the constraint that the probability distribution is nor- 
malized. 

We now derive the general form of P{m\n). From ele- 
mentary probability theory, we have 



P{n\m) * Q{m) 



n< N 



-^P{m\n) ; N 



(1) 



From Eqn. (Q) and the properties discussed in the previ- 
ous paragraph, P{m < n\n) = f{m) is independent of n, 
and P(m > n\n) = 0. Thus the general form of P(rn\n) 
is 



P{m\n) = e{n ~ m)f{m) + [l - f(^">] 



(2) 
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FIG. 1. The values of f{m) as obtained from P{m\n), 
which is calculated from simulations for the n-values n — 5 
(o), n = 10 (□), n = 15 (*), and n = 500 (o). This figure 
confirms the statement that P(m<n|n) is independent of n. 
The parameters for the figure are L = 1000, p — 0.25. Similar 
confirmation has been obtained from simulations for a wide 
range of parameter values. 

where 8(x) is defined by Q{x) — for a; < and Q(x) — 
1 for X > 0. We have numerically confirnied the validity 
of this form, as shown in Fig. ^ 

The fact that P{m\n) is completely determined by 
/(to) has some surprising consequences, regardless of the 
exact form of f{m). To explore these consequences, we 
examine some other properties of small-world networks. 
For example, besides £, the quantity <£{n)>, which is the 
average minimal distance separating two sites with Eu- 
clidean distance n, has been discussed in the literature 
p^ , p^ . We can express both these quantities in terms of 
P{in\n) as follows: 



<iin)>-- 



mPlmln) 



(3) 



L 



^ N-l n N 

— [2 ^ ^ mP(m\n) + ^ mP{m\N) (4) 



Similar expressions hold for <£'^{n)>, <£^{n)>, and l"^. 
Substituting the form of P{m\n) [Eqn. (|^)] into the ex- 
pression for £ [Eqn. (13)] , we obtain 
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2E E™/M + 2E^^[i- E/(^ 



n—l m— 1 n— 1 



(5) 



+ ^TO/(m)+iV[l- ^/(m)] 

m— 1 m— 1 

which can be simplified to give the following exact ex- 
pression: 



FIG. 2. The average minimal separation, £ = Lg{x), vs. 
average number of short-cuts, x = pL, as obtained from (a) 
numerical simulation by averaging over all pairs (*); (b) nu- 
merical simulation using Eqn. H (o), and (c) Fade-fit as given 
by Ref. [12] (solid line). This confirms the exact expression 
Eqn. (0). 



£ = <£{N)> 1 + 
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Similarly, we also obtain an exact expression for the 
variance of the distribution of minimal distances, = 



= <£^{N)> (1 + 
1 



1 



+ 



3(L- 1 

<£{N)> (I 



L-\' 
<£{N)> -4 <£^{N)> 

1 <£^{N)>i2 



L-r 



L - 1 



(7) 



The surprising aspect of the above equations is that 2 
and CT^, which are average properties of the entire net- 
work, are completely determined by the mean separation 
of 'diametrically opposite sites' (d.o.s.), <£{N)>, and its 
higher moments <£'^{N)> and <^^(A^)>. This fact, in 
particular, makes for significant numerical simplification 
in the computation of £. Note that Eqns. (^ and (|^) 
can readily be generalized to any k by performing the 
substitution L l—^ \L/k~\ . 

When the network has exactly one short-cut, we can 
calculate J analytically using Eqn. (||). In this case, in the 
limit of large N, we get <£{N)>=jN and <£'^{N)>^^N'^ 
which gives J — -^N. As expected, this is in perfect 
agreement with the results obtained by Strang and Eriks- 
son |l^,^. We have further confirmed Eqn. (||) by 
numerically computing £ using the following two proce- 
dures: (i) Averaging the minimal distance over all pairs 
of sites, and (ii) considering only pairs of d.o.s. and 
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FIG. 3. This figure confirms the proposed scahng form of 
/(m, N,p) (Eqn. (p^) for x — pL = 10 and system sizes: (a) 
L = 500 (*), (b) L = 750 (o), and (c) L = 1000 (o). We have 
confirmed this scahng coUapse for a wide range of x- values. 

using Eqn. (^). The results, which are presented in Fig. 
^, indicate that the two procedures are equivalent. 

The results obtained so far have been independent of 
the functional form of f{m). To gain further insight, 
we consider the scaling properties of f{m), following the 
real-space renormalization group (RG) analysis of New- 
man and Watts . This procedure consists of blocking 
pairs of adjacent sites while preserving the total number 
of short-cuts in the network. This gives for the trans- 
formed lattice: N' = N/2 and p' = 2p. We note the 
following features of this transformation |^: (i) the ge- 
ometry of the minimal paths is unchanged in almost all 
cases, and the number of site-pairs for which the geome- 
try does change is negligible for large L and small p, and 
(ii) the distance along the minimal path is halved, i.e. 
to' = to/2 for large L and small p. Now, by definition 
/(to) = P{m\N) (to < N); thus to evaluate /(to) we 
need to calculate the average number of minimal paths 
of length TO between two d.o.s. in the network. Further- 
more we note that the RG-transformation maps two pairs 
of d.o.s. into a single pair. This fact, in conjunction with 
points (i) and (ii) above gives us 

/'(f ,f ,2p) = 2/(TO,iV,p) (8) 

For large N, taking the continuum limit, we can gener- 
alize the above expression to 

/'(^,^,Ap) = A/(TO,iV,p) (9) 

These observations can now be summarized in the fol- 
lowing scaling form: 



f{m,N,p) = ^h{y,x) (10) 

771 

where ^ ~ ^ ~ 2pN 

By fixing a;, we have observed the scaling collapse of /(to) 
for different values of N and p. This is demonstrated 
numerically for a; = 10 in Fig.|^. Our simulations indicate 
that for any given x, this scaling collapse holds for large 
enough N. 

It is interesting to note that the scaling properties of 1 
can be derived from the scaling form of f{m). Using the 
definition of £ (Eqn. (||)) and the scaling form for f{m), 
we get 

I=^j[l-j)ly{l~yfh{y,x)] (11) 

= L g{x) (12) 

which is consistent with the scaling form proposed in pre- 
vious works. Similar scaling forms hold for <£{N)>, 
and <e^{N)>. 

We now consider the limit of large system sizes such 
that x 3> 1. In this limit, we have observed numerically 
that we can approximate /(to) by a gaussian distribution 
function: 

f{m) = ^—e ^ (13) 

where and cr^ are respectively the mean and variance 
of the distribution. The corresponding fit for x = 250 
and X — 500 is shown in Fig. ^. Our simulations indicate 
that as X increases, {(J-g/crg) also increases, as can be seen 
from the figure. 

Using the gaussian approximation for f{m), we are 
now able to calculate the function <l{n)>, which has 
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FIG. 4. /(m) vs. mior x=pL = 250 and L = 2000. The 
solid line is the gaussian fit to the calculated data. The inset 
shows the gaussian fit for x = 500 and L — 2000. Note that 
with increasing x, the gaussian becomes more sharply peaked. 
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have derived some exact relations which will significantly 
aid computational efforts. We have obtained an approxi- 
mate scaling form for this probability distribution in the 
limit of large system sizes. It is our hope that further ef- 
forts along these lines will provide a better understanding 
of the structure of small- world networks. 
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and the Norwegian University of Science and Technology 
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FIG. 5. The mean distance <£{n)> between two sites hav- 
ing Euclidean separation n for x — pL — 250. Results are 
shown for: (a) numerical simulation (o), and (b) analytic ex- 
pression (Eqn. ([l4|)) (solid line). The analytic expression is 
an excellent fit for a; 2> 1. 



been discussed elsewhere 141^]. From Eqn. (||) and (|l 
in the limit /ig 3> Ug, we obtain 



<iin)> =n-i(n-^g)[$(!L^; 



2tt 



(14) 



where we have assumed $(/ig/-\/2(Tg) = 1. In this limit, 
substituting the above form of f{m) into the definitions 
of <^(iV)> and <f{N)>, we get 



= <£iN)> 
al = <f{N)> 



<£{N)>^ 



(15) 
(16) 



In particular, these equations imply that fig and ag 
have the following scaling forms: fig ^ L gi{x) and 
ag ^ L g2{x)- Using these relations, we see that the 
gaussian ansatz for /(m) (Eqn. (|l^)) is consistent with 
the scaling form proposed in Eqn. (p^. In Fig.||, we 
compare Eqn. (14) to results from our simulations for 
X = 250. In the limit L oo, we have {ug/ fig) — s- 0, 
which upon substitution into Eqn. ( p^ ) gives us 



<£(n)> = 



n 

<t{N)> 



n < 
n > 



<i{N)> 
<iiN)> 



(17) 
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This expression for <£{n)> is consistent with that derived 
in Ref. ||l^,|l^ in the same limit. 

In conclusion, we have studied the probability distri- 
bution for minimal path-lengths in small-world networks. 
We have presented arguments for the general analytical 
form this distribution must take. Using this form, we 
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